Data Surveying: Foundations of an Inductive Query Language
نویسنده
چکیده
Data mining systems have to evolve from a set of specialised routines to more generally applicable inductive query languages to satisfy industry’s need for strategic information. This paper introduces such an inductive query language called Data Surveying. Data Surveying is the discovery of "interesting subsets" of the database. Groups of customers whose behaviour deviates from average customer behaviour are exampies of such interesting subsets. A user specifies what makes a subset interesting through a survey task. The wide applicability of this scheme is illustrated by a variety of examples. To implement aa inductive query language system, the ’~vhat" (the kind of strategic information sought) has to be made independent from the "how" (how this strategic information is discovered). In other words, the discovery algorithms have to be task independent. In this paper, operators on the search space are introduced to achieve this independence. The discovery algorithms are defined relative to these operators. To enforce efficient discovery, the notion of polynomial convergence is defined for these algorithms. Domain knowledge plays an important role in the specification of both the survey task and the operatots.
منابع مشابه
انتخاب مناسبترین زبان پرسوجو برای استفاده از فراپیوندها جهت استخراج دادهها در حالت دیتالوگ در سامانه پایگاه داده استنتاجی DES
Deductive Database systems are designed based on a logical data model. Data (as opposed to Relational Databases Management System (RDBMS) in which data stored in tables) are saved as facts in a Deductive Database system. Datalog Educational System (DES) is a Deductive Database system that Datalog mode is the default mode in this system. It can extract data to use outer joins with three query la...
متن کاملAn Inductive Logic Programming Query Language for Database Mining
First, a short introduction to inductive logic programming and machine learning is presented and then an inductive database mining query language RDM (Relational Database Mining language). RDM integrates concepts from inductive logic programming, constraint logic programming, deductive databases and meta-programming into a flexible environment for relational knowledge discovery in databases. Th...
متن کاملA Logic-Based Approach to Mining Inductive Databases
In this paper, we discuss the main problems of inductive query languages and optimisation issues. We present a logic-based inductive query language and illustrate the use of aggregates and exploit a new join operator to model specific data mining tasks. We show how a fixpoint operator works for association rule mining and a clustering method. A preliminary experimental result shows that fixpoin...
متن کاملChapter 2 CONSTRAINT - BASED DATA MINING
Knowledge Discovery in Databases (KDD) is a complex interactive process. The promising theoretical framework of inductive databases considers this is essentially a querying process. It is enabled by a query language which can deal either with raw data or patterns which hold in the data. Mining patterns turns to be the so-called inductive query evaluation process for which constraint-based data ...
متن کامل17 Constraint - based Data Mining
Knowledge Discovery in Databases (KDD) is a complex interactive process. The promising theoretical framework of inductive databases considers this is essentially a querying process. It is enabled by a query language which can deal either with raw data or patterns which hold in the data. Mining patterns turns to be the so-called inductive query evaluation process for which constraint-based Data ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995